Newest 'python reinforcement-learning deep-rl' Questions

1vote

1answer

122views

RL agent for autonomous vehicle is able to follow the road but can't avoid crashing at all (Highway-Env / Racetrack Env.)

I coded some deep RL algorithms (DQN and SAC) with tf2/keras to solve an environment where a vehicle needs to follow the track and avoid crashing into one other vehicle (there is only one other ...

rafiqollective

11

asked Nov 6, 2023 at 21:05

3votes

0answers

152views

Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?

I have a Reinforcement Learning problem where the optimal policy does not depend on the next state (ie gamma equals 0). I think this means that I only need an efficient exploration algorithm coupled ...

AJSV

31

asked Oct 15, 2022 at 7:33

1vote

0answers

318views

How to parallelize multi-agent DDPG (MADDPG)

I am experimenting with MADDPG algorithm implemented in this repo. Since there were only a few agents (2-3) in the implementation (also in the original paper) steps like parameter updates, action ...

Mika

361

asked Sep 5, 2021 at 22:06

1vote

0answers

130views

How should I simulate this Markov Decision Process?

I am working on solving a problem on nodes in a graph communicating with each other. They try to estimate a central state using Kalman consensus filter, with the connections described by the graph's ...

Suraj Joshi

11

asked Feb 19, 2021 at 17:27

3votes

0answers

66views

Deep Q-Learning agent poor performing actions. Need help optimizing

I'm trying to make deep q-learning agent from https://keon.io/deep-q-learning My environment looks like this: https://i.sstatic.net/EJHTD.jpg As you can see my agent is a circle and there is one ...

EnesZ

131

asked Apr 11, 2019 at 23:18

Stack Exchange Network

All Questions

RL agent for autonomous vehicle is able to follow the road but can't avoid crashing at all (Highway-Env / Racetrack Env.)

Are there Reinforcement Learning algorithms specialized for the case $\gamma=0$?

How to parallelize multi-agent DDPG (MADDPG)

How should I simulate this Markov Decision Process?

Deep Q-Learning agent poor performing actions. Need help optimizing

Hot Network Questions

All Questions

Related Tags